Method and apparatus for reducing mosquito noise in decoded video sequence

ABSTRACT

A ringing area detector classifies the input image into two regions: a mosquito noise region (i.e. filtering region) and a non-mosquito noise region (i.e. non-filtering region), and uses this classification information to adaptively remove the mosquito noise in a mosquito noise reduction system. The mosquito noise reduction system includes a ringing area detector, a local noise power estimator, a smoothing filter, and a mixer. The ringing area detector includes an edge detector, a near edge detector, a texture detector, and a filtering region decision block. The ringing detection block detects the ringing area where the smoothing filter is to be applied. The local noise power estimator controls the filter strength of the smoothing filter. The smoothing filter smoothes the input image. The mixer mixes the smoothed image and the original image properly based on the region information from the ringing area detection block.

FIELD OF THE INVENTION

The present invention relates generally to image or video postprocessing, and more particularly to mosquito noise reduction in decodedvideo sequence.

BACKGROUND OF THE INVENTION

Many popular image and video compression standards, such as JPEG andMPEG2, are based on Discrete Cosine Transform (DCT) processing. Thebasic approach of the DCT-based image and video compression technique isto subdivide the image into 8×8 blocks and then individually transform,quantize, and encode each block. In DCT-based compression techniquesartifacts may occur, specially at low and moderate bit rates. Amongseveral well-known artifacts, one is typically known as mosquito noiseor ringing noise, which mostly appears in image homogeneous regions nearstrong edges. Mosquito noise is caused by loss of high frequencytransform coefficients during the quantization step.

In recent years, several techniques have been proposed for mosquitonoise reduction. Generally speaking, there are two types of techniquesfor mosquito noise reduction. One uses coding parameters to control thefiltering process. The other does not rely on the coding information,and is usually called post-processing. The first type techniques requirethe decoder to send out some coding parameters along with the decodedvideo sequence, which in many practical situations are impossible. Assuch, applications for the first type of noise reduction techniques arelimited. The second type techniques usually apply some adaptivefiltering methods in order to avoid damaging image structures. Among thesecond type techniques, a technique proposed in the patent applicationWO 02/102086 uses a segmentation map to guide the filtering process. Itclassifies the image into five regions and assigns different regionswith different filtering weights. FIG. 1 shows a block diagram of theimage classifier 100 used in this method. However, classifying the imageinto five regions and using several cascaded filters, increases hardwarecomplexity and cost.

BRIEF SUMMARY OF THE INVENTION

The present invention addresses the above shortcomings. An object of thepresent invention is to provide a simple ringing (mosquito noise) regiondetection method that creates an accurate filtering map to guide thesmoothing filter to effectively reduce mosquito noise.

Another object of the present invention is to provide a noise reductionapparatus for effectively reducing mosquito noise in decoded videosequence associated with the ringing region detection.

In one embodiment the present invention provides a ringing areadetection method that classifies the input image into two regions: amosquito noise region (i.e. filtering region) and a non-mosquito noiseregion (i.e. non-filtering region), and uses this classificationinformation to adaptively remove the mosquito noise. Compared toaforementioned prior art, the present invention provides simpler ways ofimage segmentation for the purpose of mosquito noise reduction. Thepresent invention further provides a method of avoiding abrupt changesbetween different regions and possible flicker, by providingsoft-switching between filtering region and non-filtering region.

The present invention further provides a mosquito noise reduction systemcomprising a ringing area detector, a local noise power estimator, asmoothing filter, and a mixer. The ringing area detector includes anedge detector, a near edge detector, a texture detector, and a filteringregion decision block. The ringing detection block detects the ringingarea where the smoothing filter is to be applied. The local noise powerestimator controls the filter strength of the smoothing filter. Thesmoothing filter smoothes the input image. The mixer mixes the smoothedimage and the original image properly based on the region informationfrom the ringing area detection block.

Other features and advantages of the present invention will be apparentfrom the following specifications taken in conjunction with thefollowing drawings.

BRIEF DESCRIPTION OF THE DRAWINGS

FIG. 1 shows a block diagram of an image region classification systemused for a mosquito noise reduction system in prior art.

FIG. 2 shows an example block diagram of an embodiment of a mosquitonoise reduction system according to the present invention.

FIG. 3 shows an example diagram of an image divided into non-overlappedP×Q blocks.

FIG. 4 shows an example block diagram of a variation of the mosquitonoise reduction system shown in FIG. 2 according to the presentinvention, which also uses edge map for filtering region decision.

FIG. 5 shows an example block diagram of another variation of themosquito noise reduction system shown in FIG. 2 according to the presentinvention, which uses binary filters to filter the edge map and thetexture map.

FIG. 6 shows an example block diagram of another variation of themosquito noise reduction system shown in FIG. 2 according to the presentinvention, which uses soft switching to control the smoothing filter.

FIG. 7 shows an example block diagram of another variation of themosquito noise reduction system shown in FIG. 2 according to the presentinvention, which uses a delayed edge-block map stored in memory toreduce the hardware complexity.

In the drawings, like reference numbers refer to similar elements.

DETAILED DESCRIPTION OF THE INVENTION

FIG. 2 shows an example block diagram of a mosquito noise reductionsystem 200 according to an embodiment of the present invention. Themosquito noise reduction system 200 comprises a ringing area detectionunit 202 including a gradient computation unit 204, an edge detectionunit 206, a near edge decision unit 208, a texture detection unit 210,and a filtering region decision unit 212. The mosquito noise reductionsystem 200 further comprises a local noise power estimator 216, asmoothing filter 218, and a mixer 220. The local noise power estimator216 includes a high pass filter (HPF) 222, a local standard deviationcalculation unit 224 and a converter 226. The function of the ringingarea detection unit 202 is to detect the ringing region (or mosquitonoise region) where the smoothing filter 218 needs to be applied.

The mosquito noise reduction system 200 can be applied to both theluminance signal Y and the chrominance signals U and V in a videosignal. As such, in general, the input signal is denoted by F(i,j),where i and j are the indices for image row and column respectively.Referring to FIG. 2, the gradient computation unit 204 computes thegradient of the input signal F(i,j), generating the gradient |∇F(i,j)|.Those skilled in the art will recognize that different numerical methodscan be used to calculate the gradient ∇F(i,j) for a given image.

Using the gradient ∇F(i,j), edges and texture information are thenexamined. The edge detection unit 206 detects an edge by comparing themagnitude of the gradient, |∇F(i,j)|, with a threshold value T₁ as:

${E\left( {i,j} \right)} = \left\{ \begin{matrix}{1,} & {{{if}{{\nabla\;{F\left( {i,j} \right)}}}} > T_{1}} \\{0,} & {else}\end{matrix} \right.$

where 1 denotes edge, and 0 denotes non-edge, and T₁ is a predeterminedconstant.

Because the ringing artifacts in compressed video arises near edges, itis necessary to detect whether a current pixel is near an edges or not.For this purpose, the near edge detection unit 208 checks whether thereare edges in the neighborhood of the current pixel. Specifically,considering a P×Q image pixel block and counting the edge samples inthis block, if the block contains more than a certain number of edges,it is assumed that the current pixel is located near edges. Accordingly,let N_(e) ^(P×Q)(i,j) denote the number of edge samples in the P×Q blockaround the current pixel (i,j), then the near edge detection NE(i,j) iscomputed as:

${{NE}\left( {i,j} \right)} = \left\{ \begin{matrix}{1,} & {{{if}\mspace{14mu}{N_{e}^{P \times Q}\left( {i,j} \right)}} > N_{T}} \\{0,} & {else}\end{matrix} \right.$

where 1 denotes near-edge pixel and 0 denotes non-near-edge pixel, whereN_(T) is a predetermined constant.

For simplification, the present invention also proposes to usenon-overlapped blocks for near edge detection. (Note that the block forcomputing N_(e) ^(P×Q)(i,j) assume overlapped blocks or sliding blocksas pixels are processed.) That is, as shown by example in FIG. 3, theimage 300 is divided into multiple P×Q blocks, and then the near-edgeblock NEB (I, J) is detected as:

${{NEB}\left( {I,J} \right)} = \left\{ \begin{matrix}{1,} & {{{if}\mspace{14mu}{N_{e}^{P \times Q}\left( {I,J} \right)}} > N_{T}} \\{0,} & {else}\end{matrix} \right.$

where N_(e) ^(P×Q)(I,J) represents the number of edge samples of the(I,J)th block in which I=└i/P┘, J=└j/Q┘, and └x┘ is the floor functionwhich returns to the integer part of the nonnegative number x . Further,NEB(I,J)=1 implies that the (I,J)th block is a near-edge block. Then,the near edge detection for the current pixel F(i,j) is set as:NE(i,j)=NEB(I,J).

Note that with the use of non-overlapped block, the pixels that belongto the (I,J)th block have the same near-edge information. Thissignificantly reduces computation complexity when computing near-edgedetection compared to the overlapped block based detection where it isassumed that near edge detection needs to be performed for every pixelposition repeatedly.

Referring back to FIG. 2, the texture detection unit 212 detects thetexture TX(i, j) by comparing |∇F(i,j)| with another threshold value T₂,as:

${{TX}\left( {i,j} \right)} = \left\{ \begin{matrix}{1,} & {{{if}{{\nabla\;{F\left( {i,j} \right)}}}} > T_{2}} \\{0,} & {else}\end{matrix} \right.$

where 1 denotes texture, 0 denotes non-texture and T₂ is a predeterminedconstant which is less than T₁.

The filtering region decision unit 214 generates the filtering regionmap R(i,j) based on the supplied near-edge map NE(i,j) and the texturemap TX(i,j). The near-edge map actually already marks the image regionwhere the mosquito noise occurs. However, the near-edge map may alsocontain some texture and filtering texture will cause blurring.Therefore, in order to obtain an accurate filtering region map, textureneeds to be eliminated from the edge-block map. The logic for removingthe texture from the near-edge map is:

${R\left( {i,j} \right)} = \left\{ \begin{matrix}{1,} & {{{{if}\mspace{14mu}{{NE}\left( {i,j} \right)}} = {{1\mspace{14mu}{and}\mspace{14mu}{{TX}\left( {i,j} \right)}} = 0}},} \\{0,} & {{otherwise};}\end{matrix} \right.$

where 1 denotes the filtering region where ringing artifact occurs and 0denotes the non-filtering region.

Since the near edge decision unit needs P lines of image data togenerate the near-edge map, line memory delay may happen at thefiltering region decision unit 214. The filtering region map generatedby the filtering region decision unit 214 is then supplied to the mixer220.

The purpose of the local noise power estimator 216 in FIG. 2 is toprovide the local filter strength σ_(n) to be used by the smoothingfilter 218. The HPF 222 extracts mainly the noise component of the inputsignal. In a preferred embodiment, the following HPF is used:

$\begin{bmatrix}1 & {- 2} & 1 \\{- 2} & 4 & {- 2} \\1 & {- 2} & 1\end{bmatrix}.$

The local standard calculator 224 calculates the local standarddeviation σ_(h) of the high-pass filtered signal HF(i,j) over a r×swindow in the video image as:

${{\sigma_{h}\left( {i,j} \right)} = {\frac{1}{r \times s}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- s}}^{s}{{{H\;{F\left( {{i + m},{j + n}} \right)}} - {\mu_{h}\left( {i,j} \right)}}}}}}},{where}$${\mu_{h}\left( {i,j} \right)} = {\frac{1}{r \times s}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- s}}^{s}{H\;{{F\left( {{i + m},{j + n}} \right)}.}}}}}$

The calculated local standard deviation σ_(h) is forwarded to theconverter 226 which converts the local ringing noise power σ_(h) to theequivalent local additive Gaussian noise power σ_(n). The converter 226implements the following relation:σ_(n)=Bσ_(h) ^(0.45),

where B is a user controlled global filter strength parameter whichcontrols the overall smoothing level of the smoothing filter. Theestimated local noise power σ_(n) is supplied to the smoothing filter218.

The smoothing filter 218 comprises an edge preserving filter whichremoves noise while retaining image edges. The output F_(NR)(i,j) of thesmoothing 218 filter is supplied to the mixer 220. There are manyexamples of edge preserving filters that can be used as the smoothingfilter 218 in the mosquito noise reduction system 200 according to thepresent invention. One example smoothing filter is the weighted sigmafilter, which is defined as:

${{F_{NR}\left( {i,j} \right)} = {\frac{1}{N}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}{w_{m,n} \cdot {F\left( {{i + m},{j + n}} \right)}}}}}};$${wherein},\text{}{w_{m,n} = \left\{ {{{\begin{matrix}2 & {{{{if}{{{F\left( {{i + m},{j + n}} \right)} - {F\left( {i,j} \right)}}}} < {C_{1}\sigma_{n}}},} \\1 & {{{{if}{{{F\left( {{i + m},{j + n}} \right)} - {F\left( {i,j} \right)}}}} < {C_{2}\sigma_{n}}},} \\0 & {{otherwise};}\end{matrix}N} = {\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}w_{m,n}}}};} \right.}$

and C₁, C₂ are predetermined constants with 0<C₁<C₂, and σ_(n) is thelocal noise power.

Another example smoothing filter is the minimal mean square error (MMSE)filter. The MMSE filter is defined as:

${{F_{NR}\left( {i,j} \right)} = {{\mu\left( {i,j} \right)} + {\frac{\max\left( {{{\sigma^{2}\left( {i,j} \right)} - \sigma_{n}^{2}},0} \right)}{{\max\left( {{{\sigma^{2}\left( {i,j} \right)} - \sigma_{n}^{2}},0} \right)} + \sigma_{n}^{2}} \cdot \left\lbrack {{F\left( {i,j} \right)} = {\mu\left( {i,j} \right)}} \right\rbrack}}},$

wherein σ_(n) is the local noise power, and, μ(i,j) and σ(i,j) are thelocal mean and local standard deviation computed over a r×r window as:

${{\mu\left( {i,j} \right)} = {\frac{1}{r^{2}}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- s}}^{s}{F\left( {{i + m},{j + n}} \right)}}}}},{{\sigma\left( {i,j} \right)} = {\frac{1}{r^{2}}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- s}}^{s}{{{{F\left( {{i + m},{j + n}} \right)} - {\mu\left( {i,j} \right)}}}.}}}}}$

The mixer 220 in FIG. 2 selects either the output F_(NR)(i,j) of thesmoothing filter 218 or the original input signal F(i,j) as the finaloutput based on the filtering region map, separating the image into tworegions: filtering region and non-filtering region. In the filteringregion, the output F_(NR)(i,j) of the smoothing filter 218 is selectedas the final output. In the non-filtering region, the original inputsignal F(i,j) is selected as the final output.

There are many example variations of the mosquito noise reduction system200 shown in FIG. 2, according to embodiments of the present invention.FIG. 4 shows a block diagram of a system 400 which is a variation of themosquito noise reduction system 200 shown in FIG. 2, wherein the system400 additionally uses the edge map for filtering region decision. Inthis case, edges are included in the filtering region map. With thesupplied edge map E(i,j), the filtering region decision unit 214 in thesystem 400 generates the filtering region map as:

${R\left( {i,j} \right)} = \left\{ \begin{matrix}1 & {{{if}\mspace{14mu}\left( {{{NE}\left( {i,j} \right)} = {{1\mspace{14mu}{and}\mspace{14mu}{{TX}\left( {i,j} \right)}} = 0}} \right)\mspace{14mu}{or}\mspace{14mu}\left( {{E\left( {i,j} \right)} = 1} \right)},} \\0 & {{otherwise};}\end{matrix} \right.$

where 1 denotes the filtering region and 0 denotes the non-filteringregion.

FIG. 5 shows a block diagram of an example system 500 which is yetanother variation of the mosquito noise reduction system 200 shown inFIG. 2, that additionally uses binary filters (BF) 201 to filter theedge map and the texture map. The purpose of using the binary filters201 in the system 500 is to remove the small holes in the edge map andthe texture map. Preferably, the type of binary filters 201 used is themathematical morphology closing operator. If g(i,j) is a binaryfunction, then the closing operator C_(w)(·) is defined as:C _(w)(g)(i,j)=min_(w)(max_(w)(g(i,j))),

wherein W is a window centered at the pixel (i,j), and min_(w)(·) andmax_(w)(·) are the functions that return the minimal and maximal valueswithin the window W, respectively. Preferably, the window W used for theclosing operator is a 5×5 window.

FIG. 6 shows a block diagram of an example system 600 which is yetanother variation of the mosquito noise reduction system 200 shown inFIG. 2, wherein the system 600 uses a soft switching unit 230 to controlthe smoothing filter 218 of the system 600. In the system 600, thefiltering region map R(i,j) first passes through a low-pass filter 203to generate a smoothed version of the filtering region map {tilde over(R)}(i,j). The function of the low-pass filter 203 is to create a smoothtransition between the filtering region and the non-filtering region.With this smoothed region map, hard switching can be replaced by softswitching. Therefore, abrupt changes between the filtered region and thenon-filtered region in the processed image can be avoided using thesystem 600.

In FIG. 6, the soft switching unit 230 uses the smoothed filteringregion map {tilde over (R)}(i,j) to control the switching between theresult of the smoothing filter F_(NR)(i,j), and the original signalF(i,j). The basic idea is to use the filter result F_(NR)(i,j) in thefiltering region (i.e., mosquito noise region) and gradually switchingto the original signal when getting into the non-filtering region (i.e.,non mosquito noise region). The soft switching unit 230 determines thefinal output of the systemF_(OUT)(i,j) as:F _(OUT)(i,j)={tilde over (R)}(i,j)·F _(NR)(i,j)+(1−{tilde over(R)}(i,j))·F(i,j).

Since the near edge decision unit map in a mosquito noise reductionsystem according to the present invention needs P lines of image data togenerate the near-edge map, for line memory constrained hardwareimplementation, system delay may happen. Accordingly, referring to FIG.7, a mosquito noise reduction system 700 using delayed near-edge map isprovided to reduce the hardware complexity. The system 700 is yetanother variation of the mosquito noise reduction system 200 shown inFIG. 2, wherein system 700 in FIG. 7 uses a delayed near-edge map storedin a memory unit (M) 240 to reduce the hardware complexity. In thiscase, the filtering region decision unit uses the delayed near-edge map,which is generated from previous frame and is stored in the memory unit240, and the texture map of the current frame to generate the filteringregion map. Further, the near-edge map of the current frame is passed tothe memory unit 240 instead of the filtering region decision unit 214.The other components of the system 700 are similar to those of thesystem 200 shown in FIG. 2.

The present invention has been described in considerable detail withreference to certain preferred versions thereof; however, other versionsare possible. Therefore, the spirit and scope of the appended claimsshould not be limited to the description of the preferred versionscontained herein.

1. An apparatus for detecting ringing area in a decoded video sequence, comprising: a gradient computation unit that calculates a gradient of the video image; an edge detector that detects an edge region in the video image and generates an edge map, the edge detector using a magnitude of the gradient to determine if a pixel is an edge pixel; a near edge detector that uses the edge map to detect near-edge region and generates a near-edge map; a texture detector that detects texture in the video image and generates a texture map; and a filtering region decision unit that uses the near-edge map and the texture map to generate a filtering region map.
 2. The apparatus of claim 1, wherein the edge detection unit detects the edge region by comparing the magnitude of the gradient of each pixel in the video image with a threshold value, and marks the pixel as an edge pixel if its gradient is greater than the threshold value.
 3. The apparatus of claim 1, wherein the near edge detector detects the near-edge region by counting the edge pixels within a block in the video image, wherein the block contains the current pixel, and marks the current pixel as a near-edge pixel if the number of edge pixels within the block exceeds certain threshold value.
 4. The apparatus of claim 3, wherein said block used for near-edge detection overlaps other blocks in the video image.
 5. The apparatus of claim 3, wherein said block used for near-edge detection is non-overlapping with other blocks in the video image.
 6. The apparatus of claim 1, wherein the texture detection unit detects the texture region by comparing the magnitude of the gradient of each pixel in the video image with a threshold value, and marks the pixel as a texture pixel if its gradient is greater than the threshold value.
 7. The apparatus of claim 1, wherein said filtering region decision unit generates the filtering region map by excluding the texture region from the near-edge region.
 8. A noise reduction system for reducing mosquito noise in a decoded video sequence, comprising: a ringing area detection unit apparatus as claimed in claim 1 that generates a filtering region map for the video image; a local noise power estimator that estimates an equivalent local additive Gaussian noise power from the video image; a smoothing filter that removes unwanted noise from the video image based on said estimate; and a mixer that selects either the result of the smoothing filter or the original input video image as output based on the filtering region map.
 9. The system of claim 8, wherein said local noise power estimator comprises: a high-pass filter that extracts the noise component of the video image signal F(i,j); a local standard deviation calculator that calculates the local standard deviation σ_(h) of the high-pass filtered signal; and a converter that converts the local standard deviation σ_(h) of the high-pass filtered signal into a equivalent local additive Gaussian noise power σ_(n) as said estimate.
 10. The system of claim 9, wherein said local standard deviation calculator calculates the local standard deviation σ_(h) of the high-pass filtered signal HF(i,j) over a r×s window in the video image as: ${\sigma_{h}\left( {i,j} \right)} = {\frac{1}{r \times s}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- s}}^{s}{{{{HF}\left( {{i + m},{j + n}} \right)} - {\mu_{h}\left( {i,j} \right)}}}}}}$ wherein ${\mu_{h}\left( {i,j} \right)} = {\frac{1}{r \times s}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- s}}^{s}{{{HF}\left( {{i + m},{j + n}} \right)}.}}}}$
 11. The system of claim 9, wherein said converter converts the local standard deviation σ_(h) of the high-pass filtered signal into a equivalent local additive Gaussian noise power σ_(n) as: σ_(n)=Bσ_(h) ^(0.45), wherein B is global filter strength parameter which controls the overall smoothing level of the smoothing filter.
 12. The system of claim 8, wherein said smoothing filter comprises a weighted sigma filter that filters the input signal as: ${{F_{NR}\left( {i,j} \right)} = {\frac{1}{N}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}{w_{m,n} \cdot {F\left( {{i + m},{j + n}} \right)}}}}}},{wherein}$ F_(NR)(i, j)  is  the  filtered  signal; $w_{m,n} = \left\{ {{{\begin{matrix} 2 & {{{{if}\mspace{14mu}{{{F\left( {{i + m},{j + n}} \right)} - {F\left( {i,j} \right)}}}} < {C_{1}\sigma_{n}}},} \\ 1 & {{{{if}\mspace{14mu}{{{F\left( {{i + m},{j + n}} \right)} - {F\left( {i,j} \right)}}}} < {C_{2}\sigma_{n}}},} \\ 0 & {{otherwise};} \end{matrix}N} = {\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}w_{m,n}}}},} \right.$ and C₁, C₂ are predetermined constants with 0<C₁<C₂, and σ_(n) is the local noise power.
 13. The system of claim 8, wherein said noise smoothing filter comprises a minimal mean square error filter that filters the signal as: ${{F_{NR}\left( {i,j} \right)} = {{\mu\left( {i,j} \right)} + {\frac{\max\left( {{{\sigma^{2}\left( {i,j} \right)} - \sigma_{n}^{2}},0} \right)}{{\max\left( {{{\sigma^{2}\left( {i,j} \right)} - \sigma_{n}^{2}},0} \right)} + \sigma_{n}^{2}} \cdot \left\lbrack {{F\left( {i,j} \right)} - {\mu\left( {i,j} \right)}} \right\rbrack}}},{wherein}$ F_(NR)(i, j)  is  the  filtered  signal; ${{\mu\left( {i,j} \right)} = {\frac{1}{r^{2}}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}{F\left( {{i + m},{j + n}} \right)}}}}},{{\sigma\left( {i,j} \right)} = {\frac{1}{r^{2}}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}{{{F\left( {{i + m},{j + n}} \right)} - {\mu\left( {i,j} \right)}}}}}}},$ and σ_(n) is the local noise power.
 14. The system of claim 8, wherein said mixer selects the result of the smoothing filter as the output in the filtering region, and selects the original input signal as the output in the non-filtering region.
 15. A noise reduction system for reducing mosquito noise in decoded video image sequence, comprising: a ringing area detection apparatus as claimed in claim 1 that generates a filtering region map for a video image; a low-pass filter that smoothes the filtering region map to generate a smooth transition between a filtering region and a non-filtering region; a local noise power estimator that estimates the equivalent local additive Gaussian noise power from the video image; a smoothing filter that removes unwanted noise from the video image; and a soft switching unit that gradually switches the smoothing filter on and off based on the smoothed filtering region map.
 16. The system of claim 15, wherein said soft switching unit determines the output F_(OUT)(i,j) of the system as: F _(OUT)(i,j)={tilde over (R)}(i,j)·F _(NR)(i,j)+(1−{tilde over (R)}(i,j))·F(i,j), wherein {tilde over (R)}(i,j) is the smoothed filtering region map, F_(NR)(i,j) is the output of the smoothing filter, and F(i,j) is the original input video image.
 17. An apparatus for detecting a ringing area in a decoded video sequence, comprising: a gradient computation unit that calculates a gradient of the video image; an edge detector that detects an edge region in the video image and generates an edge-map, the edge detector using a magnitude of the gradient to determine if a pixel is an edge pixel; a near edge detector that uses the edge map to detect near-edge region and generates a near-edge map; a texture detector that detects texture in the video image and generates a texture map; and a filtering region decision unit that generates a filtering region map based on the near-edge map, the texture map, and the edge map.
 18. The apparatus of claim 17, wherein said filtering region decision unit generates the filtering region map by excluding the texture region from the near-edge region but keeping the edge region in the near-edge region.
 19. An apparatus for detecting ringing area in a decoded video sequence, comprising: a gradient computation unit that calculates a gradient of the video image; an edge detector that detects an edge region in the video image and generates an edge-map, the edge detector using a magnitude of the gradient to determine if a pixel is an edge pixel; a first binary filter for filtering the edge-map; a near edge detector that uses the edge map to detect near-edge region and generates a near-edge map; a texture detector that detects texture in the video image and generates a texture map; and a second binary filter for filtering the texture map; and a filtering region decision unit that generates a filtering region map based on the filtered near-edge map and the filtered texture map.
 20. The apparatus as claimed in claim 19, wherein the first binary filter comprises a mathematical morphology closing operator, and the second binary filter comprises another mathematical morphology closing operator.
 21. An apparatus for detecting ringing area in a sequence of decoded video frames, comprising: a texture detector that detects texture in a current video frame and generates a texture map; a gradient computation unit that calculates a gradient of the video image; an edge detector that detects an edge region in the current video frame and generates an edge map, the edge detector using a magnitude of the gradient to determine if a pixel is an edge pixel; a near edge detector that uses the edge map to detect near-edge region in the current video frame and generates a near-edge map; a memory unit that stores the near-edge map of the previous video frame; and a filtering region decision unit that generates a filtering region map based on the delayed near-edge map of a previous frame and the texture map of the current frame.
 22. A method of detecting a ringing area in a video image indicating mosquito noise therein, comprising: employing a noise reduction processor for: performing a gradient computation to determine the magnitude of a gradient in the video image; classifying the input image into a mosquito noise filtering region and a non-mosquito noise region using the gradient computation; and using the classification information to adaptively remove the mosquito noise from the video image.
 23. The method of claim 22, wherein the step of classifying further includes the steps of detecting a ringing area in the video image by: detecting edge regions in the video image and generating an edge map; detecting near-edge regions in the video image using the edge map, and generating a near-edge map; detecting texture in the video image and generating a texture map; and generating a filtering region map representing a mosquito noise filtering region, based on the near-edge map and the texture map.
 24. The method of claim 23, before the step of detecting an edge region, such that the step of edge detection further includes comparing the gradient of each pixel in the video image with a threshold value and marking the pixel as an edge pixel if its gradient is greater than the threshold value.
 25. The method of claim 23 wherein the steps of detecting near-edge regions further includes the steps of counting the edge pixels within a block in the video image, wherein the block contains the current pixel, and marking the current pixel as a near-edge pixel if the number of edge pixels within the block exceeds certain threshold value.
 26. The method of claim 25 wherein said block overlaps other blocks used for near-edge detection in the video image.
 27. The method of claim 25 wherein said block is non-overlapping with other blocks used for near-edge detection in the video image.
 28. The method of claim 23, before the step of detecting texture, wherein the step of detecting texture further includes the steps of detecting a texture region by comparing the gradient of each pixel in the video image with a threshold value, and marking the pixel as a texture pixel if its gradient is greater than the threshold value.
 29. The method of claim 23 wherein the step of generating a filtering region map further includes the steps of excluding the texture region from the near-edge region.
 30. A method of reducing mosquito noise in a digital video image, comprising: employing a noise reduction processor for: performing a gradient computation to determine the magnitude of a gradient in the video image; classifying the input image into a mosquito noise filtering region and a non-mosquito noise region using the gradient computation; using the classification information to adaptively remove the mosquito noise from the video image; generating a filtering region map for the video image; determining a local noise power estimator that estimates an equivalent local additive Gaussian noise power from the video image; using a smoothing filter to remove noise from the video image based on said estimate; and selects either the result of the smoothing filter or the original input video image as output based on the filtering region map.
 31. The method of claim 30, wherein the step of determining the local noise power estimate further includes the steps of: using a high-pass filter to extract the noise component of the video image signal pixels F(i,j); calculating a local standard deviation σ_(h) of the high-pass filtered signal; and converting the local standard deviation σ_(h) of the high-pass filtered signal into a equivalent local additive Gaussian noise power σ_(n) as said estimate.
 32. The method of claim 31, wherein the step of local standard deviation calculation further includes the steps of calculating the local standard deviation σ_(h) of the high-pass filtered signal HF(i,j) over a r×s window in the video image as: ${\sigma_{h}\left( {i,j} \right)} = {\frac{1}{r \times s}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- s}}^{s}{{{{HF}\left( {{i + m},{j + n}} \right)} - {\mu_{h}\left( {i,j} \right)}}}}}}$ wherein ${\mu_{h}\left( {i,j} \right)} = {\frac{1}{r \times s}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- s}}^{s}{{{HF}\left( {{i + m},{j + n}} \right)}.}}}}$
 33. The method of claim 31, wherein the step of converting further includes the steps of converting the local standard deviation σ_(h) of the high-pass filtered signal into a equivalent local additive Gaussian noise power σ_(n) as: σ_(n)=Bσ_(h) ^(0.45), wherein B is global filter strength parameter which controls the overall smoothing level of the smoothing filter.
 34. The method of claim 30, wherein said smoothing filter comprises a weighted sigma filter that filters the input signal as: ${{F_{NR}\left( {i,j} \right)} = {\frac{1}{N}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}{{w_{m,n} \cdot F}\left( {{i + m},{j + n}} \right)}}}}},{wherein}$ F_(NR)(i, j)  is  the  filtered  signal; $w_{m,n} = \left\{ {{{\begin{matrix} 2 & {{{{if}\mspace{14mu}{{{F\left( {{i + m},{j + n}} \right)} - {F\left( {i,j} \right)}}}} < {C_{1}\sigma_{n}}},} \\ 1 & {{{{if}\mspace{14mu}{{{F\left( {{i + m},{j + n}} \right)} - {F\left( {i,j} \right)}}}} < {C_{2}\sigma_{n}}},} \\ 0 & {{otherwise};} \end{matrix}N} = {\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}w_{m,n}}}},} \right.$ and C₁, C₂ are predetermined constants with 0<C₁<C₂, and σ_(n) is the local noise power.
 35. The method of claim 30, wherein said smoothing filter comprises a minimal mean square error filter that filters the signal as: ${{F_{NR}\left( {i,j} \right)} = {{\mu\left( {i,j} \right)} + {\frac{\max\left( {{{\sigma^{2}\left( {i,j} \right)} - \sigma_{n}^{2}},0} \right)}{{\max\left( {{{\sigma^{2}\left( {i,j} \right)} - \sigma_{n}^{2}},0} \right)} + \sigma_{n}^{2}} \cdot \left\lbrack {{F\left( {i,j} \right)} - {\mu\left( {i,j} \right)}} \right\rbrack}}},{wherein}$ F_(NR)(i, j)  is  the  filtered  signal; ${{\mu\left( {i,j} \right)} = {\frac{1}{r^{2}}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}{F\left( {{i + m},{j + n}} \right)}}}}},{{\sigma\left( {i,j} \right)} = {\frac{1}{r^{2}}{\sum\limits_{n = {- r}}^{r}{\sum\limits_{m = {- r}}^{r}{{{F\left( {{i + m},{j + n}} \right)} - {\mu\left( {i,j} \right)}}}}}}},$ and σ_(n) is the local noise power.
 36. The method of claim 30, wherein the step of selecting further includes the steps of selecting the result of the smoothing filter as the output in the filtering region, and selecting the original input signal as the output in the non-filtering region.
 37. A method of reducing mosquito noise in decoded video image sequence, comprising: employing a noise reduction processor for: performing a gradient computation to determine the magnitude of a gradient in the video image; classifying the input image into a mosquito noise filtering region and a non-mosquito noise region using the gradient computation; using the classification information to adaptively remove the mosquito noise from the video image; generating a filtering region map for a video image; using a low-pass filter to smooth the filtering region map and generate a smooth transition between a filtering region and a non-filtering region; estimating a local noise power that estimates the equivalent local additive Gaussian noise power from the video image; using a smoothing filter to remove unwanted noise from the video image; and gradually soft switching the smoothing filter on and off based on the smoothed filtering region map to generate an output.
 38. The method of claim 37, wherein the step of soft switching further includes determining the output F_(OUT)(i,j) as: F _(OUT)(i,j)={tilde over (R)}(i,j)·F _(NR)(i,j)+(1−{tilde over (R)}(i,j))·F(i,j), wherein {tilde over (R)}(i,j) is the smoothed filtering region map, F_(NR)(i,j) is the output of the smoothing filter, and F(i,j) is the original input video image. 